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NONLINEAR FRAMES AND SPARSE RECONSTRUCTIONS IN 

BANACH SPACES 

QIYU SUN AND WAI-SHING TANG 


Abstract. In the first part of this paper, we consider nonlinear extension of 
frame theory by introducing bi-Lipschitz maps F between Banach spaces. Our 
linear model of bi-Lipschitz maps is the analysis operator associated with Hilbert 
frames, p-frames, Banach frames, g-frames and fusion frames. In general Banach 
space setting, stable algorithm to reconstruct a signal x from its noisy measure¬ 
ment F(x) 4- e may not exist. In this paper, we establish exponential conver¬ 
gence of two iterative reconstruction algorithms when F is not too far from some 
bounded below linear operator with bounded pseudo-inverse, and when F is a 
well-localized map between two Banach spaces with dense Hilbert subspaces. 
The crucial step to prove the later conclusion is a novel fixed point theorem for 
a well-localized map on a Banach space. 

In the second part of this paper, we consider stable reconstruction of sparse 
signals in a union A of closed linear subspaces of a Hilbert space H from their 
nonlinear measurements. We create an optimization framework called sparse 
approximation triple (A, M, H), and show that the minimizer 

x = argmin 4gM with ||F(*)-F(x°)||<ell*llM 

provides a suboptimal approximation to the original sparse signal x° £ A when 
the measurement map F has the sparse Riesz property and almost linear prop¬ 
erty on A. The above two new properties is also discussed in this paper when 
F is not far away from a linear measurement operator T having the restricted 
isometry property. 


1. Introduction 

For a Banach space B, we denote its norm by || • ||b- A map F from one Banach 
space Bi to another Banach space Bo is said to have bi-Lipschitz property if there 
exist two positive constants A and B such that 

(1.1) A\\x- y\\ Bl < \\F(x) - F(y)\\ B2 < B\\x-y\\ Bl for all x, y G Bi. 

Our models of bi-Lipschitz maps between Banach spaces are analysis operators 
associated with Hilbert frames, p-frames, Banach frames, g-frames and fusion 
frames mm ns m in- Our study is also motivated by nonlinear sampling 
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theory and phase retrieval, which have gained substantial attention in recent years 

The framework developed in the first part of this paper 
could be considered as a nonlinear extension of frame theory. 

Denote by B{ Bi, B 2 ) the Banach space of all bounded linear operators from one 
Banach space Bi to another Banach space B 2 . A continuous map F from Bi to 
B 2 is said to be differentiable at x £ Bi if there exists a linear operator, denoted 
by F'(x), in £>(Bi,B 2 ) such that 

r ll-F(a + V) ~ F (x) - F'(x)y\\ B2 _ 
v- 0 ||y|| Bl 

and to be differentiable on Bi if it is differentiable at every x £ Bi |20| . For 
a differentiable bi-Lipschitz map F from Bi to B 2 , one may easily verify that 
its derivatives F'(x),x £ Bi, are uniformly stable, i.e., there exist two positive 
constants A and B such that 


( 1 . 2 ) 


A||y|| Bl < \\F'(x)y\\-B 2 < -B||y|| Bl for all x,y £ Bi. 


The converse is not true in general. Then we have the following natural question. 
Question 1: When does a differentiable map with the uniform stability property 
(1.2) have the bi-Lipschitz property (1.1)? 

We say that a linear operator T £ £>(Bi,B 2 ) from one Banach space Bi to 
another Banach space B 2 is bounded below if 

\\Ty Hb 2 


(1.3) 


inf 


II2/II 


> 0 . 


For a continuously differentiable map F not too nonlinear, particularly not far 
away from a bounded below linear operator T, a sufficient condition for (1.1) is 
that for any 0 / y £ Bi, the set B(y) of unit vectors F'(x)y/\\F'(x)y\\-B 2 ,x £ Bi, 
is contained in a ball of radius 


(1.4) 

with center at Ty/||Ty|| B2 , where 

(1.5) [3f.t '■= sup sup 

O^yGBi i£Bi 


Pf,t < 1 


F'{x)y 

Ty 

\\F'(x)y\\ B2 

\\Ty ||b 2 


The above geometric requirement on the radius @f,t is optimal in Banach space 
setting, but it could be relaxed to 

( 1 . 6 ) /3f,t < V2 

in Hilbert space setting, which implies that for any 0 y £ Bi, the set B(y) is 
contained in a right circular cone with axis Ty/||Ty|| B2 and angle strictly less than 
7r/2. Detailed arguments of the above conclusions on a differentiable map are given 
in Appendix [Aj 

Denote by F’(Bi) C B 2 the image of a map F from one Banach space Bi to 
another Banach space B 2 . For a bi-Lipschitz map F : Bi —> B 2 , as it is one-to-one, 
for any y £ F( Bi) there exists a unique x £ Bi such that F(x) = y. Our next 
question is as follows: 
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Question 2: Given noisy observation z e = F(x u ) + e of x° E Bi corrupted by 
e E B 2 , how to construct a suboptimal approximation x E Bi such that 

(1.7) \\x - x°|| Bl < C'llellsa, 

where C is an absolute constant independent of x° E Bi and e E B 2 ? 

For a differentiable bi-Lipschitz map F not far away from a bounded below linear 
operator T, define x n ,n > 0, iteratively with arbitrary initial xq E Bi by 

(1.8) x n+ i = x n - pT\F(x n ) - z e ), n > 0, 

where is a bounded left-inverse of the linear operator T, and the relaxation 
factor n satisfies 0 < p < (sup xeBl sup ;y _^ 0 ||-F / (aOy|| B2 /||T , ?/||] 


\-i 


In Theorem 2.1 


of Section [2J we show that the sequence x n ,n > 0, in the iterative algorithm ( |1.8[ ) 
converges exponentially to a suboptimal approximation element 1 EB 1 satisfying 


(1.7), provided that 


\-i 


(I- 9 ) Pf,T < (l| r llB(Bi,B 2 )ll^llB(B2,Bi)) 

The above requirement ( |1.9[ ) about /3f,t to guarantee convergence of the iterative 
algorithm (1.8) is stronger than the sufficient condition (1.4) for the bi-Lipschitz 


property of the map F. In Theorem 2.2, we close that requirement gap on /3f,t i n 


Hilbert space setting by introducing an iterative algorithm of Van-Cittert type, 


( 1 . 10 ) 


u n+ i = u n - pT*(F(u n ) - z £ ), n > 0, 


where T* is the conjugate of the linear operator T and p > 0 is a small relaxation 
factor. 

In the iterative algorithm (1.8), a left-inverse Tt of the bounded below linear op¬ 


erator T is used, but its existence is not always assured in Banach space setting and 
its construction is not necessarily attainable even it exists. This limits applicability 


of the iterative reconstruction algorithm (1.8). In fact, for general Banach space 
setting, a stable reconstruction algorithm may not exist m ITS] . On the other 
hand, a stable iterative algorithm is proposed in (48] to find sub-optimal approxi¬ 


mation for well-localized nonlinear maps on sequence spaces t? p (Z), 2 < p < 00 . So 
we have the following question. 

Question 3: For what types of Banach spaces Bi and Bo and nonlinear maps F 
from Bi to Bo does there exist a stable reconstruction of x° E Bi from its nonlinear 
observation y = F(x°) E Bo ? 

We say that a Banach space B with norm || ■ || B is Hilbert-dense (respectively 
weak-Hilbert-dense ) if there exists a Hilbert subspace H C B with norm || ■ ||h such 
that H is dense in B in the strong topology (respectively in the weak topology) of 
B and 

IM|b 


sup 


x H 


< OO. 


Our models of the above new concepts are the sequence spaces £ p , 2 < p < 00 , for 
which i v with 2 < p < 00 are Hilbert-dense and £°° is weak-Hilbert-dense. For 
(weak-)Hilbert-dense Banach spaces Bi and B 2 and a nonlinear map F : Bi —> B 2 
that has certain localization property, a stable reconstruction of x° E Bi from its 

of Section [3| 


nonlinear observation y = F(x°) E B 2 is proposed in Theorem 


3.1 
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The crucial step is a new fixed point theorem for a well-localized differentiable map 


whose restriction on a dense Hilbert subspace is a contraction, see Theorem 3.2 


Let Hi and H 2 be Hilbert spaces, and let A = Uj g /Aj be union of closed linear 
subspaces A i,i E I , of the Hilbert space Hi. The second topic of this paper is 
to study the restricted bi-Lipschitz property of a map F : Hi —> H 2 on A, which 
means that there exist two positive constants A and B such that 

(1.11) A\\x - y[| Hl < \\F(x) - F(y)\\ H2 < B\\x - y\\ Ul for all x, y E A. 

This topic is motivated by sparse recovery problems on finite-dimensional spaces 
da m Eg E6]. As we use the union A of closed linear spaces A j,i E I, to model 
sparse signals, the restricted bi-Lipschitz property of a map F could be thought as 
nonlinear correspondence of restricted isometric property of a measurement matrix. 
So the framework developed in the second part is nonlinear Banach space extension 
of the finite-dimensional sparse recovery problems. 

In the classical sparse recovery setting da OH, .231 E2], the set of all s-sparse 
signals for some s > 1 is used as the set A. In this case, elements in A can be 
described by their t°-quasi-norms being less than or equal to s, and the sparse 
recovery problem could reduce to the ^-minimization problem. Due to numerical 
infeasibility of the ^-minimization, a relaxation to (non-)convex ^-minimization 
with 0 < q < 1 was proposed, and more importantly it was proved that the l q - 
minimization recovers sparse signals when the linear measurement operator has 
certain restricted isometry property in f 2 \1% \27\ [2S1 S3- This leads to the 

following question. 

Question 4: How to create a general optimization framework to recover sparse 
signals? 

Given a Banach space M, we say that a subset K of M is proximinal ( }10l '38]) 
if every element x E M has a best approximator y E K , that is, 

||® - 2/||m = inf ||x - z|| M =: vk, m(®)- 

zGK 

Given Hilbert spaces Hi and H 2 , a union A = Uj g /Ai of closed linear subspaces 
A i,i E /, of Hi, and a continuous map F from Hi to H 2 , consider the following 
minimization problem in a Banach space M, 

(!- 12 ) x* = argmin^M with F(3 , )= Jx|| M 

for any given observation 2 := F{x) for some x E A. To make the above minimiza¬ 
tion problem suitable for stable reconstruction of x E A from its observation F(x), 
we introduce the concept of a sparse approximation triple (A, M, Hi): 

(i) (Continuous imbedding property) The Banach space M contains all ele¬ 
ments in A and it is contained in the Hilbert space Hi, that is, 

(1.13) A CMC Hi, 

and the imbedding operators ix '■ A —> M and : M —> Hi are bounded. 

(ii) (Proximinality property) The Banach space M has A as its closed subset, 
and all closed subsets of M being proximinal. 
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(iii) (Common-best-approximator property) Given any i E I, a best approxima¬ 
tor SAj,M := argmin £eA . \\x — x||m of a; G M in the norm || ■ ||m is also a 
best approximator in the norm || ■ ||hi, that is, 

(1.14) x Ai ,M = argmin £gA J|x - ®||hi- 

(iv) (Norm-splitting property) For the best approximator x a ..m of x G M in 
the norm || • ||m, 

(1-15) \\x\\m = ||®A;,m||m + \\x — ®Aj,M ||m> 

and 

(1-16) IMIhi = II^a^mIIhj + ||® - ®A;,m||h 1 - 

(v) (Sparse density property) L)k>ikA is dense in Hi, where 

k 

kA := A + A + • • • + A = | Xj : x\,... ,Xk G A j, k> 1. 

k times 

One may easily verify that these five properties are satisfied for the triple (A, M, Hi) 
in the classical sparse recovery setting, where A is the set of all s-sparse vectors, 

M is the set of all summable sequences, and Hi is the set of all square-summable 
sequences mmmm- 

In this paper, we rescale the norm || • ||m in the sparse approximation triple 
(A, M, Hi) so that the imbedding operator im has norm one, 

(1.17) 

otherwise replacing it by || • 
measure sparsity 

(1.18) s A : = 
for signals in A, and sparse 

(1.19) a A 

for elements in M, where x a ,m and u a ,m G A are the first and second best 
approximators of x respectively, 


IKm||b(M,Hi) — 1 > 

|m||*m||b(m,Hi)- Next we introduce two quantities to 


IfIIm 

Fa|Ib(a,m) _ [ SU P || II 

' 0/o:GA ||®||Hi 
approximation ratio 


: = sup 


Fa.mIIhi 

||®a,m||m 


< 1 


(1.20) ||a? — ®a,m||m — <®a,m(®) and II® ~ ®a,m — «a.m||m — <®a,m(® — ®a,m)- 
The upper bound estimate in (1.19) holds, since 

II^A.mIIm = II® - ®A,m||m - II® — ®A,M — RA.mIIm 

< \\x — U Aj m||m — ||® — ®A,M — «A,m||m < ||®A,m||m, ® G M, 


by the norm splitting property (1.15). In the classical sparse recovery setting with 
A being the set of all s-sparse signals, one may verify that s A = s and a A = 1/s, 
see Appendix |B] for additional properties of sparse approximation triples. 


Having introduced the sparse approximation triple (A, M, Hi), our next ques¬ 
tion is on optimization approach to sparse signal recovery, see B EU3 El El M EE 
[39] for the classical setting. 
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Question 5: Given a sparse approximation triple (A, M, Hi), for what type of 
maps F : Hi —> H 2 does the solution x^ of the optimization problem 

(1-21) := al 'g m i n .T 6 M with ||F(£)-F( 3 : 0 )||<£||®IIm, 

is a suboptimal approximation to the sparse signal x° in A? 

In this paper, without loss of generality, we assume that F( 0) = 0. We say that 
F : Hi —> Ho has sparse Riesz property if 

( 1 . 22 ) ||F(x)||h 2 > H _ 1 (||x|| Hl - Py/aX a a, m(®)), x G M, 

with D, (3 > 0, and that F is almost linear on A if 

(1.23) 

\\F(x)-F(y)-F(x-y)\\u 2 < 'yi\\x-y\\u 1 + r y 2 V cl A( cr A,M{x)+(J AM (y)), x,y G M, 
with 71,72 > 0. Combining the sparse Riesz property and almost linear property 
of a map F gives 

\\F(x) - F(y)\\ n2 > (IT 1 - 7 i)||x-y|| Hl 
(1-24) -(H _1 /3 + 72 )\/oa(cta,m(®) + o’A.M(y)), x,y e M, 

and hence F has the restricted bi-Lipschitz property on A, 

II F{x) - F(y) ||h 2 > ( D~ l - 7 i)||x - y|| Hl , x,y € A, 

when 71 and D satisfy H 71 < 1. In Section [ 4 J we show that the solution x^- of the 
optimization problem (1.21) is a suboptimal approximation to the signal x° in M, 
i.e., there exist positive constants C\ and C 2 such that 

(1.25) ||xm - a : 0 1|Hi < Ci^aXa AM {x°) + C 2 e, 

provided that F has the sparse Riesz property ( |1.22[ ) and almost linear property 
(1.23) with D,/3 ,71 and 72 satisfying 

1 - 2H71 - (H71 + H72 + ff)y/a A SA > 0. 

We remark that the approximation error estimate (1.25) implies the sparse Riesz 
property (1.22) for the map F, 

||i^)||H 2 > C'^ 1 (||®||hi - C , l\/oACr Ai M( x)), X G M, 

which follows from (1.25) by taking x° = x and e = ||R 1 (x 0 )||h 2 - 

The sparse Riesz property was introduced in m with a different name, sparse 
approximation property, for the classical sparse recovery setting; and the almost 
linear property was studied in [281134] for bi-Lipschitz maps between Banach spaces. 
In Section [5] we consider the following question. 

Question 6 : When does a map F : Hi —» H 2 have the sparse Riesz property (1.22) 
and the almost linear property ( 1.23[ ) ? 

We say that a linear operator T G £>(Hi, H 2 ) has the restricted isometry property 
(RIP) on 2A if 

(1.26) (1 - <5 2 a(T))||z||h 1 < ||^||h 2 < (1 + 6 2A (T))\\z\\ 2 Ul for all z € 2A, 

where <5 2 a(T) G [0,1) |12) 14). A nonlinear map F : Hi 1 — > H 2 , not far away from 
a linear operator T with the restricted isometry property (1.26) in the sense that 

7 .f,t(2 A) < \J\ — 52a(T) 
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has the restricted bi-Lipschitz property (|1.11|) on A, where 
(1.27) 


TF,T{hA) := sup sup 

ccEM zEkA 


\\F(x + z)-F(x)-Tz\\ H2 


F||Hi 


, k > 1 . 


In Section [5] we show that F has the sparse Riesz property (1.22) when 

V2 


7f,t( 2A) <\~ V*2a(T) < \/l ~5 2A (T), 


and the almost linear property (1.23) when 7f,t(4A) < 00 . Therefore signals 
x E M could be reconstructed from their nonlinear measurements F(x) when F is 
not far from a linear operator T with small restricted isometry constant 5oa(T), 
see Theorem 15.41 


2. Iterative Reconstruction Algorithms 


For a Banach/Hilbert space B, we also denote its norm by || ■ || for brevity. In 
this section, we establish exponential convergence of the iterative reconstruction 


algorithms ( 1 . 8 ) and ( 1 . 10 ). 


Theorem 2.1. Let Bi and B 2 be Banach spaces, F be a differentiable map from 
Bi to B 2 with its derivative being continuous and uniformly stable, and let T E 
B(Bi,B 2 ) be bounded below. Assume that ( |1.9| ) holds for some bounded left-inverse 
Tt : B 2 —> Bx of the linear operator T. Given positive relaxation factor y > 0, 
an initial xq E Bi and a noisy observation data z e := F(x°) + e E B 2 for some 
x° E Bi with additive noise e E B 2 , define x n ,n > 1, iteratively by ( |1.8[ ). 
x n ,n > 0, converges exponentially to some Xoo E Bi 


Then 


with 


(2.1) H^oo 2' 0 II — 

provided that 

( 2 . 2 ) 

Moreover, 




inf inf 


I F'{x) 


l-/3 FiT ||T||||Tt|| Wb i0 *, gBi \\Ty\\ 


0 < H < ( sup sup . 

V xeB 1 o^yeB 1 \\Ty\\ > 


-1 


,1 „ / \\T^\\\\F(xo)-z e \\f . f . f \\F'(x)y\\^-i n 

(2.3) \\Xn — Xoo|| < - 7 - xT77TiTTTA=rrn ( 1 5I inf^ — ||rp M —I rtf, n > 1, 


where 


To = 1 — At(l — Pf,t\\T 


l-fellTlIllTtll V^BiO^eBx \\Ty\\ 

F'{x)y\ 


inf inf llm .. 
xeBiO^yeBx ||Ty|| 


e(0,i). 


















QIYU SUN AND WAI-SHING TANG 


Proof. Set a n = J 0 ' \\F , (x n -i+t(x n -x n -i))(x n -x n -i)\\dt,n > 1. Then torn > 1, 
ll^n+l x n || 

= ||(x„ - X n -l) - flT Jf (F(x n ) ~ F(x n -l))\\ 

T ] T{x n - x n -i) 


< 


Xn X n — 1 fJ'CXn 


||T'(x n x n —i) 


/ \\F (Xn— 1 t(x n X n — l))(x n X n _i)|| 

</0 

(^n—1 “1“ t(Xn X n — l))(x n 3?n—l) T(x n X n _i) 


r »( 


■ 1 H“ ^{Xn X n —\)^(x n Xyi — 1) 11 H^X^n X n —l) 

- i 1 - y ||T(x„-"x„-,)l| ) “ X ”- 1 " + '‘“" /Sf ' tI|T ” 11 

< r 0 \\x n -x n -i\\ 


dt 


by (1.8), (1.9) and (2.2). This proves the exponential convergence of x n ,n > 0, to 
its limit Xoo £ Bj. 

Taking limit in (1.8) gives 

(2.4) T\F{ Xoo )-F{x°)) = tU, 

because 

||T t (i ? (z 0O ) - z e ) || < ||T T (F(x 00 ) - F(x n ))|| + ||x n+ i - x n \\/n —> 0, n —> oo. 
Then it follows from ( |1.9| ), \2.2\ and ( |2.4[ ) that 


X — Xn 


fo WF'iXoc + t(x° - X oo ))(x 0 - Xoo) || dt 
\\ T ( x ° - ®oo)|| 

tU f \\F'(x oo + t(x° - x 00 ))(®° - Xoo)II Q Xc °\w dt ) 
y J 0 10 \ x x oo)\\ ' 

< /3 F) t\\T j \\^ J ll-F^Xoc + f(x° - Xoo))(x 0 - x^ 

+ [ Ff xoo + t(x° - Xoo))(x 0 - Xoo)dt 

Jo 

fd \\F'( x <x + t(x° - Xoo)(x° - Xoo)||dt 


< Pf,t\\T 


|| T(x°-x c 


(2.5) 


x Nrc — Xoo II + 


which proves (2.1). 
Observe that 


Xn 


*J| < E |N +1 - x*|| < 


k=n 


1 - T 0 


Then the estimate (2.3) follows. 


□ 
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The iterative algorithm (1.8) in Theorem 2.1 provides a stable reconstruction of 
x E Bi from its noisy observation F(x) + e E Bo when (3 f ,t < (||T||||Tt||)-\ a 
requirement stronger than Pf.t < 1 that guarantees the bi-Lipschitz property for 
the map F, see Theorem A.3|in Appendix [Aj Next we close that requirement gap 


on f$F,T in Hilbert space setting, cf. Theorem | A.7 


Theorem 2 . 2 . Let Hi and H2 be Hilbert spaces, F be a differentiable map from 
Hi to H2 with its derivative being continuous and satisfying (1.2), and let T E 
B(Hi,H 2 ) satisfy ( |1.3| ) and ( |1.6[ ). Given relaxation factor p > 0, an initial uq E 
Hi, and noisy data z e := F(u u ) + e for some u° E Hi with additive noise e E H2, 
define u n ,n > 1, iteratively by (1.10). Then u n ,n > 0, converges exponentially to 
some ttoo E Hi with 

° 2||T|| 

U °° U ~ (2-/3^ T )(inf|| ti || =1 ||Tu||)(inf weHl inf|| u || = i||T 1/ (?;)u||) 
provided that 

(2.7) 0 < p < (2 — /3f T ) 


2 v inf|| u || = i ||Tu||) (inf„ eHl inf M=1 \\F'(v)u\ 


ll T ll 2 ( su P«e Hl ll-F'(v)ll) 

Proof. Define S := T*F. Observe that 

(F\u)v,Tv) = ||F'(uH|||Tu||(l-^ 

2 - (/?f,t) 2 


F’{u)v 


Tv 


> 


\\F'(u)v\ 

1^(^1111^11- 


\\Tv\\ 


Therefore 


(ui - v 2 , S(vi) - S(v 2 )) = {F(v 1 ) - F(v 2 ),T(v 1 - v 2 )) 

[ (F'(v 2 + t(v 1 - v 2 )(vi - v 2 ), T(v 1 - v 2 ))dt 

Jo 


( 2 . 8 ) 


> 


> 


2 — (/ 3f,t y 


2 — (Pf,tY 


(jf || F'(v 2 + t(v 1 -v 2 )(vi - v 2 )\\dt)\\T(vi -u 2 )|| 


inf 

||li||=l 


inf inf ||T , (?;)m|| ) ||vi — ^ 211 2 , 
ueHi ||till =1 7" 


where A is the lower stability bound in (1.2). Also one may easily verify that 
(2.9) H-S'(ui) - S(v 2 )|| < ||T||( sup ||F , (u)||)||ui - v 2 \\, v 1 ,v 2 E Hi. 

«eHi 


Therefore by standard arguments (see for instance ra, we obtain from ( |2.8[ ) and 
||lln+l 11'nll^ o||^ra Uyi —111" , Tl P 1, 


(|2.9|) that 


where 
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This proves the exponential convergence of u n ,n > 0, in the iterative algorithm 


( 1 . 10 ). 


Taking limit in the algorithm (1.10) leads to 
(2.10) T*(F(uoo) — w e ) = 0, 

where rtoo is the limit of the sequence u n ,n > 0. Thus 
2 — (3 2 


J F,T 


( inf ||Tu||)( inf inf ||F , (n)n||) \\uoo - u 


< <«oo - u°,T*(F(u oo) - F(u°))) = (n u - Uoo , T*e) 


01 | 2 


< IITII ||ti° — u, 


by (2.8) and (2.10). This proves (2.6) and completes the proof. 


□ 


3. Iterative algorithm for localized maps 

In this section, we develop a fixed point theorem for a well-localized map on a 
Banach space whose restriction on its dense Hilbert subspace is a contraction, and 


we establish exponential convergence of the iterative algorithm (1.10) for certain 
localized maps between (weak-)Hilbert-dense Banach spaces. 

To state our results, we recall the concept of differential subalgebras. Given two 
unital Banach algebras A\ and A 2 , A\ is said to be a Banach subalgebra of A 2 if 
A\ C Aii Ai and A 2 share the same identity and supo^gyq ll^ll^/ll^ll-Ai < 00 
holds; and a Banach subalgebra A\ of A 2 is said to be a differential subalgebra of 
order 9 £ (0,1] if there exists a positive constant D such that 

(3.1) IPiTiiu < eimiuilTilu(({!{£)' + (g^f) 1 

for all nonzero Ti,T 2 £ Ai P EH E2 ES]. We remark that differential subalge¬ 
bras include many families of Banach algebras of infinite matrices with certain 
off-diagonal decay and localized integral operators |33133 03103 SZl S3 EH HBl, 
and they have been widely used in operator theory, non-commutative geometry, 
frame theory, algebra of pseudodifferential operators, numerical analysis, signal 
processing, control and optimization etc, see 0 Ea E3 US HU 113 S3 HE] , the 
survey papers [291 [36] and references therein. 

Next we define the conjugate T* of a localized linear operator T between (weak-) 
Hilbert-dense Banach spaces. Given Banach spaces B,; and their dense Hilbert sub¬ 
spaces i = 1, 2, we assume that linear operators T reside in a Banach subspace 
B of H(Hi,H 2 ) and also of B(Bi,B 2 ). For a linear operator T £ B, its restriction 
T|h! to Hi is a bounded operator from Hi to H 2 , hence its conjugate (T|hi)* 
is well-defined on H 2 , and the “conjugate” of T is well-defined if the conjugate 
(T| Hl )* can be extended to a bounded operator from B 2 to Bi. The above ap¬ 
proach to define the conjugate requires certain localization for linear operators in 


£>, which will be stated precisely in the next theorem, cf. (3.4). 
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Theorem 3.1. Let Bi and B 2 be Banach spaces, Hi and H 2 be Hilbert spaces 
with the property that for i = 1, 2, H* C Bj, H* is dense in B,; ; and 


(3.2) 


F Bj ^ 

sup -—-— < 00 . 

O^xeHi If|| Hi 


Assume that Banach algebra A with norm || • \\a is a unital Banach subalgebra of 
B( Bi) and a differential subalgebra ofB( Hi) of order 9 E (0,1], Let B be a Banach 
subspace of both B(Hi,H 2 ) and B(B U B 2 ), and B* be a Banach subspace of both 
B(H 2 ,Hi) and B(B 2 , Bi) such that 

(i) ST E A for all S E B* and T E B. Moreover, 

\\ST\\ a 


(3.3) 


o^TezwseB* ||S||b*||T|| b 

(ii) For any T E B and S E B* there exist unique T* E B* and S* E B with the 
property that ||T*||b* = ||T||b, ||<S , *||,g = ||iS'||b* and 

(3.4) ( Tu,w ) = ( u,T*w) and ( Sw,u ) = ( w,S*u ) for all u E Hi and w E H 2 . 

Assume that F is a differentiable map from Bi to B 2 such that its derivative F' is 
continuous and bounded from Bi into B, and 


< 00 . 


(3.5) 


/3f,T = sup sup 

xeBi ugHi 


F'(x)u 

Tu 

||F v (x)u||h 2 

\\ Tu \\n 2 


h 2 


< y/2 


for some linear operator T E B. Take an initial xo E Bi and a noisy observation 
data z e = F(x°) + e for some x° E Bi and e E B 2 , define x n ,n > 1, iteratively by 

(3.6) x n+1 =x n - pT*(F(x n ) - z e ), n > 0, 

where the relaxation factor /a satisfies 

(3.7) 


0 < /a < (2 — fd'prp) 


(in f || u || Hl =i ||Tu|| H 2 )(mf xe Bi inf||u|| Hl =l ll i?/ (^)«llH 2 ) 

( SU P||-l| Hl =l II Tw IIh 2 ) 2 (sUP 3 . 6Bi SUP|| U || \\F'(v)u\\ U2 ) 


and T* E B* is the conjugate operator defined by (3.4). Then x n ,n > 0, converges 
exponentially to some Xoo E Bi with 

(3.8) ||xoo - ic°||B 1 < C1|e|| B2 , 

where C is an absolute positive constant. 


Given a Banach space B, we say that a map G : B —> B is a contraction if there 
exists r E [0,1) such that 

||G(x) - G(y )|| B < r\\x - y|| B for all x,y E B. 

For a contraction G on a Banach space B, the Banach fixed point theorem states 
that there is a unique fixed point x* for the contraction G (i.e., G(x*) = x*), and 
for any initial xo E B. the sequence x n +i = G(x n ),n > 0, converges exponentially 
to the fixed point x* |20j . To prove Theorem |3.1[ we need a hxed point theorem 
for differentiable maps on a Banach space with its derivative being continuous and 
bounded in a differential Banach subalgebra and its restriction on a dense Hilbert 
subspace being a contraction. 
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Theorem 3.2. Let B be a Banach space, H be a Hilbert space such that H C B 
is dense in B and 

(3.9) sup [[ X ’[[ B < oo, 

o^xeH |r|h 

and let A be a Banach subalgebra of B( B) and also a differential subalgebra of 
B( H) of order 6 £ (0,1], If G is a differentiable map on B whose derivative G' is 
continuous and bounded from B into A and there exists r £ [0,1) such that 

(3.10) l|G ,, (x)|| j g( H ) < r for all x £ B, 

then there exists a unique fixed point x* for the map G. Furthermore given any 
initial xo G B. the sequence x n ,n > 0, defined by 

(3.11) x n+ i = G(x n ), n > 0, 
converges exponentially to the fixed point x*. 


Proof. Let x n , n > 0, be as in (3.11). It follows from the continuity of G' in the 
Banach subalgebra A of £>(B) that 

x n+1 -x n = G{x n ) - G{x n -i) 


G (x n _i + t(x n x n —\))dt ) {x n x n _i) 


(3.12) 

Observe that 

(3.13) 
and 

(3.14) 


= : T n (x n - x n -i ), n > 1. 


I-^tiIIb(H) ^ I \\G (x n —i -{-t(x n x n —i)) llg(H)^ — r 


\Tn\U < / \\G'(x n -i + t{x n - x n -i))\\ A dt < M 


where M = sup^gg ||G / (x )||„4 < oo by the assumption on the map G. Set 
b n = sup ||7] +ri _iT/ +n _2 • ■ -Ti\\ a , n> 1. 


l> l 


Then we obtain from (3.1), ( 3.13[ ) and ( 3.14[ ) that 


b 2 n+i < ( sup \\T m \\ A )b2n < Mb 2n 

m> 1 


and 

b 2n < 2D(sup \\T l+n _i ■ ■■T l \\ B{H) ) e (b n ) 2 - B < 2 Dr nd {b n f- 6 


for all n > 1. Thus 

b n < M e °b n - eo < M €0 (2D)r e ^ n ~ eo ^ 2 (b^ n _ eo y 2 ) 2 ~ e 

< M eo+(2 “ 0)ei (2D) 1+(2 “ 0) rf ((?l “ eo)+(n " eo “ 2ei) ^ ) (6 (n _ eo _2 ei )/4) 2 “ 0 

< 

< M^'=o ^ 2 -oy (2D)E^(2-0)V§ StoEj-=i+1 ; 
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where n = Yli=o e *2* with a E {0,1} and q = 1. Therefore 

(3.15) 

if 6 E (0,1), and 

, . . M 

(3.16) 


b n < {(2D) l l^- G \M/r 0 )^/^- e Y° &2(2 ~ e) r n ^ n > 1 


b n < —(2M/r) loE2 ’V n , n > 1 


if 0 = 1. By (3.15) and (3.16), for any r\ E (r, 1) there exists a positive constant 
C such that 


(3.17) 


IT T 

\ J -n- L n i 


■ 2~i ||_4 < Cr™, n > 1. 


Recall that .4, is a Banach subalgebra of B( B). We then obtain from (3.12) and 

(3.17) that 

\\x n +i - Xn\\ B < Cr™ ||xi - x 0 ||b, n > 1, 

which proves the exponential convergence of the sequence x n ,n > 0. 

Finally we prove the uniqueness of the fixed point for the map G. Let x* and x* 
be fixed points of the map G. Then x* — x* is a fixed point of the linear operator 
T := f ( j G'(x* + t(x* — x*))dt E A, because 

(3.18) x* -x* = G(x*) - G(x*) = T(x* - x*). 


Following the argument to prove (3.17), we obtain that lim, woo ||T n ||g( B ) = 0. 
This together with (3.18) implies that x* = x*. the uniqueness of fixed points for 
the map G. □ 


Now we apply Theorem 3.2 to prove Theorem 3.1 
Proof of Theorem \3.1\ Define G : Bi — > Bi by 
(3.19) G{x) = x- fiT*(F(x) - z e ), x E B x . 

Then G is differentiable on Bi and its derivative G'{x) = I — fj,T*F'(x), x E Bi, is 
continuous and bounded in A by the assumption on F and the Banach spaces B 
and B*. Set 


m o = 


2 


and 


Mo = ( 


inf Life 

||Tr||h 2 


inf inf 


\F'{x)u\ 


h 2 


sup 

O^uGHi ll^llHi 


sup sup 

xGBi O^uGHi 


FllHi 

F'(x)u\\h 2 

||«||hi 


Observe that 


I G'(x)\\ a < \\I\\a +p\\T\\b{ sup ||F , (a:)|| B ) 

ieBi 


\\s]Ma 

0^S u S 2 &B 11 11B11 11 B 


sup 


(3.20) 


x 


< oo 
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|G"(x)|| B(Hl ) < \\{I + nT*F'( x^Wb^WI - n 2 (T*F\x)) 2 \\ B{Hl) 
< (1 + Mq^l 2 ) sup 


u 


H, 


2,,2 


(3.21) 


< 


1 + Mq g 


1 + moli 

where the second inequality holds as 


{u, (1 + iiT*F'(x))u ) Hl 
< 1, x € Bi, 


\\(I + ^T*F\x))- l \\ B[ n l) = sup 


MlHi 


o^ugHj \\(I + gT*F'(x))u\\ Hl 

and the third inequality follows from (2.8). Combining the above two estimates 
about G'(x),x E Bi, with Theorem 3.2 proves the exponential convergence of 


x n , n > 0, in Bi. 

Denote by Xoo the limit of x n ,n > 0, in Bi. Then taking limit in the iterative 
algorithm (3.6) yields 

T*F( Xoo ) — T*F(x°) = T*e. 

Thus 

(3.22) A oo (x oo -x 0 )=T*e, 

where Aoo = J 0 * T* F' (x°+t(x QO —x°))dt. Following the argument to prove Theorem 
3.2 and applying ( 3.20| ) and (3.21), there exists a positive constant C r for any 
r E ((1 + Mq/jl 2 /(l + mon), 1) such that 

||(I-Moo) n |U<CVr n , n> 1. 

Thus Aoo is invertible in A and 


(3.23) 


l!(^oo)- 1 IU < aE ll( J -MooHU < MII^IU + c r /{l - r)). 


n=0 


Combining (3.22) and ( 3.23[ ) leads to 

11 Zoo - Z°||bi < ||(^oo) _i 


B(Bi) 


|T*e| 


< ll(^oo)- 1 |UI|r|| B ( 


sup 

O^SeA 


II'S'IIb(b 1 ) 


sup 


II^IIb(b 2 ,Bi)) 


e B, 


This proves the error estimate (3.8). 


□ 


Remark 3.3. Our model of Hilbert-dense Banach spaces in Theorem 3.1 is K P (A), 
the space of p-summable sequences £ P (A), with 2 < p < oo. For that case, expo¬ 
nential convergence of the Van-Cittert algorithm, which is similar to the iterative 
algorithm (3.6) in Theorem 3.1 is established in |48j under slightly different re¬ 


striction on the relaxation factor g. For weak-Hilbert-dense Banach spaces, the 
iterative algorithm (3.6) in Theorem 3.1 still has exponential convergence if opera¬ 
tors in B and B* are assumed additionally to be uniformly continuous in the weak 
topologies of Banach spaces, that is, sup|m| B <i | f(Tx n ) — f(T Xoo )\ -A- 0 for any 
bounded linear functional / on B 2 if x n tends to Xoo in the weak topology of Bi; 
and su P || S |, <:L \g(Sy n ) — g(Sy 00 )\ —> 0 for any bounded linear functional g on Bi 
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if y n tends to y oo in the weak topology of B 2 . We leave the detailed arguments to 
interested readers. 


4. Sparse reconstruction and optimization 
In this section, we show that sparse signals x E A could be reconstructed from 


their nonlinear measurements Fix) via the optimization approach (1.21). 
Theorem 4.1. Let Hi and Ho be Hilbert spaces, M be a Banach space, A = 


Uj 6 /Aj be union of closed linear subspaces 0 / Hi, sa and oa be in (1.18) and 
flri9i ) respectively, and let F be a continuous map from Hi to H 2 normalized 
so that F( 0) = 0. If (A, M,Hi) forms a sparse approximation triple, and if F 
has the sparse Riesz property (1.22) and the almost linear property (1.23) with 
D, (3, 71,72 > 0 satisfying 


(4.1) 


73 := 1 - 2Z?7i - (-D71 + £>72 + /3)y/a A s A > 0, 


then given x° E M and e > 0, the solution x^ of the optimization problem (1.21) 
provides a suboptimal approximation to x°, 


(4.2) 


I ^0 _ 0 [| 

I^M 11 Hi _ 


2 + 8-D72 + 4/1 \ 


73 


)v / ®A (J A,m(x ) + 


(2 + y/ a A s A )D 


and 

(4.3) 


M < 


'2 - 4Z>yi + 2{D"/i + 2 £>72 + /3)y/a A s A \ 
-) oa.m (£ 


73 


73 


Q, 2D __ 

^ + \J S A £- 

73 


To prove Theorem 4.1, we need the following approximation property for sparse 
approximation triples. 

Proposition 4.2. Let (A,M,Hi) be a sparse approximation triple and a A be as 
in G ,19[ ). Then 

(4.4) \\x - x a ,m||hi < gaIMIm, x G M, 

where xa.m is a best approximator of x G M. 

We postpone the proof of the above proposition to Appendix [B] and start the 


proof of Theorem 4.1 


Proof of Theorem \f. 1\ Let x A M := argmin£ gA ||x 0 — £||m be a best approximator 
in A to x°, where the existence follows from the proximinality property of the triple 
(A, M, Hi). Denote by A(x a M ) the linear space in A containing x A M . Then 

(4.5) Xa, m = argmin £gA(x o^ ||x° - x|| M = argmin £eA{ ^ m) ||x° - x|| Hl 


by the common best approximator property (1.14); and 

(4.6) ||x°||m = II^a.mIIm + ll^c 0 — ^a.mIIm = II^a.mIIm + °'a,m(3 ;0 ) 


by the norm splitting properties (1.15) and (1.16). 
























16 


QIYU SUN AND WAI-SHING TANG 


Let < M + ho := argmin^gA^o^ ^ Hx^ — x||m £ A(x a m ) be a best approxima¬ 
tor to in A(x a m)- Then 

(4.7) ^a.m + h 0 = argmin £gA( ^ M) ||x^ - x|| Hl 

and 


(4.8) 


I^mIIm — ll x A,M + ^o||m + II^M X A,M — ^o||m 


by the common best approximator property and norm splitting property of the 
triple (A, M, Hi). 

Set h := — x°. We then obtain from (1.21), ( |4.6| ) and ( |4.8| ) that 

\\h — ^o||m < 


(4.9) 


< 

< 


|| (x° — x’a,m) + (h — ho) || m + <7a,m (^^) 
II^mIIm — II^A.M 4" Mm + cr A,M(2 ;0 ) 

||®°||m — ||®A,M + ^o||m + °A,m(®°) 


M 


+ 2crA.M(^°)- 


Let hi := argmin^ gA J|h — ho — /i||m be a best approximator of h — ho. Then 


(4.10) 


II h — MIh, — II^tIIhi + 11^' — ho — h±\ 


H, - 


(4.11) ||h — /i 0 ||m = ||||m + ||h - ho — /ii||m, 

and 


(4.12) || h — ho~ Li||hi < a/oaII h — /iq||m 


by ( 1.14[ ), ( 1.15[ ) and (|4.4). 

From (1.21), ( |1.23 ), fhi| ) and ( |4.9[ ), it follows that 


l|T(h)|| < \\F(x 0 M )-F(x°)-F(h)\\+e 

< TiINIhx +72^X(^a,m(x°) + ct a ,m(®m)) + £ 

< 7 i||^o||hi + 7 ill^iIIHi + 7 i||^ - ho — hi 11 h x 

T72\/Ta( ( TA.,M (x^) + ||(x° — X A M ) + (h — ho) 11M) + £ 

< 7i II || Hi + 7i ll^i II Hi + (71 + 72 )a/oaII h — ho||M 
+272V / OAC r A,M(®°) + £ 

< 711 | ^0II Hi + 71 11^1 II Hi + (71 + 72 )v^XHMm 

( 4 . 13 ) +2(71 + 272 )yj HaTA.M^ 0 ) + £• 

By the definition of x A M , we have that 


X A,M — ^A(i“ 


)(x°) and x AM + h 0 — P\^ x o )(^m), 


where Pv is the projection operator from Hi to its closed subspace V. Therefore 


( 4 . 14 ) 


ho = P A ( x ° AM )(h). 
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By ( |1.22[ ), ( |4.7[ ), ( |4.10| ) and ( |4.14[ ) we get 

IIMhi 


(4.15) 
and 

(4.16) 


= II^A(x^ M ) (^) 11 Hi < II^IIhi 

< D\\F(h)\\+Py/aX<j AM (h) 

< D\\F(h)\\ +/3y/aX\\h - h 0 \\ M 


II h 


1 Hi = 


P A(fti)( J - P A« jM ))(^)I|h 1 < 

< D\\F(h)\\ +/3^/aX\\h - h 0 \\ M - 


Hi 


Hence by (1.18), ( |4.9| ), ( 4. 13| ) , (4.15) and (4.16), we have 


IIM 


Hi 


(4.17) 

and 


< Dti||/»o||hi + ( D'yi + U72 + /3)\/«a||^o||m + ^7i||^i||hi 
+2(7771 + 2 H 72 + /3)-y/oA (T A,M(-T°) + 77+ 


II h 


1 Hi 


(4.18) 


< 777i||/i 0 ||hi + (-D 71 + H 72 + /3)y/aX\\h 0 \\ M + Z? 7 i 11 /ii 11 Hi 
+2(7771 + 2 H 72 + P)y/aAO’A ) M( X °) + De. 

Combining (4.17) and ( |4. 18 ) and using ||/io||m < \/sa||^o||hi lead to 

2(777! + 27^72 + (3)^/oa(t a,m( x °) + De 


(4.19) 
and 

(4.20) 


\\h 


H, < 


< 


1 Hi + 


1 - 27771 - (7771 + 7772 + P )^/ a A SA 

2(777i + 2.D72 + / 3 )^/oA>a.m(£ 0 ) + Tie 
1 - 2D71 - (7771 + 7772 + / 3 )y/a A s A 


On the other hand, 

||h - h 0 — Tii ||Hi 

(4.21) 


< a/+a|| h - 7io||m < a/oaII^oIIm + ^V^a 17 a,m( x °) 

< %/ a A s A 11 ho 11 Hi + 2 - v /oa<TA,m(.T 0 ) 


by (1.18), (4.9) and (4.12). Therefore the error estimates (4.2) and (4.3) follow 
from dl.ISP , (|4.9|), Q4.19| ), jOol ) and ( |4.21| ). □ 

As a corollary, we have the following result for linear mapping F, cf. [53], Theo¬ 
rem 1 . 1 ] in the classical sparse recovery setting. 


Corollary 4.3. Let M, A, Hi, H 2 be as in Theorem J h l, and let F : Hi 1 — > H 2 be 
linear and have the sparse Riesz property (1.22) with 77 > 0 and (3 E (0, I/a^a^a)- 
Given x° € M and e > 0, the optimization solution of (1.21) satisfies 


and 


ll*M - x ° 


ll*M - x ° 


|Hi < 


|m < 


2 + 4/3 


1 - Py/aASA 


\ f — ( 01 1 (2 + a/®a-‘ , 'a)77 


2 D 


2 + 2/3^/oasa \ , o\ . /— 


where cfam.{ x °) = infieA ||£ — x 0 ||m- 
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5. Sparse Riesz property and almost linear property 


In this section, we consider the sparse Riesz property (1.22) and almost linear 
property (1.23) for nonlinear maps not far from a linear operator with the restricted 
isometry property (1.26). 

Theorem 5.1. Let Hi and H 2 be Hilbert spaces, M be a Banach space, and 
A = Uj e /Aj be a union of closed linear subspaces of Hi. Assume that (A, M, Hi) 
is a sparse approximation triple, and T G B(Hi,H 2 ) has the restricted isometry 
property (1.26) on 2A with <!> 2 a(T) < y/2/2. If F is a continuous map from Hi to 
H 2 with F( 0) = 0 and 

(5.1) 


v/2 


7e,t(2A) < —- \JS 2 a(T), 


then F has the sparse Riesz property (1.22), 

||^(®)||h 2 > (1 - V2(\J 5 2 a(T) + 7e,t( 2A))) ||x|| Hl 

(5.2) -(a/^a (T) + 7 F , t (2A))V^a,m(*), x G M. 
For any x G M, dehne 

(5.3) = xi, M + argmin £eA ||x - x k A M - x\\ M , k> 0, 


with initial x A M = 0. To prove Theorem 
greedy algorithm. 


5.1 


we need convergence of the above 

k > 


Proposition 5.2. Let (A, M, Hi) be a sparse approximation triple. Then x A M , a, ^ 
0, in the greedy algorithm (5.3) converges to x G M, 

(5.4) 


lim \\x A M - x|| M = 0. 

k—> 00 ’ 


We postpone the proof of the above proposition to Appendix B and start the 


proof of Theorem 5.1 


Proof. Take x G M, let x AM ,k > 0, be as in the greedy algorithm ( |5.3[ ). Then 
from Proposition |5.2[ the continuity of F on Hi, and the continuous imbedding of 
M into Hi it follows that 


(5.5) 

Write Uk = x k+1 


A.M X A,M 


lim ||F(2& m ) - F(x) 11h 2 = 0. 

k—¥ 00 

, k > 0. Then Uf. G A, and 


\\F(x)-Tx\\ 


(5.6) 


h 2 < 


< 




h 2 


k =0 


7e,t( 2A) £||« fc ||Hi 


k =0 


by ( |1.26[ ), ( |1.27[ ), ( |5.5[ ) and the assumption F( 0) = 0. 

Ih 2 


- II T(u k - %)||h 2 , where u k = 


Observe that A{Tu k ,Tiik') = || T(u k + u k 
Uk/WiikWnnk > 0. Then 
(5.7) \(Tu k ,Tu k i) - (u k ,u k r )| < (^a^IKIIhJ^a/IIh!, k,k' > 0, 
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by the restricted isometry property (1.26). We remark that in the classical sparse 
recovery setting, the inner product ( Uk,Uk>) between different u k and u k ' is always 
zero, but it may be nonzero in our setting. Hence for K > 1, 

K n K 


\Tx 


K +111 2 _ 

a,mIIh 2 


nE 


Uk 


k =0 


H 2 


K 


^2\\ T Uk\\n 2 + (Tu k ,Tu k /) 


k =0 


0 <k^k'<.K 


< (1 + (^A^)) ll u fc||Hi + ^2 ( u ki u k ' 

k= 0 0 <k^k'<K 

+<$2A (T) ^ H'Wfc||H i II^/c'IIh-i 


0 <fc/fc'<A 


(5.8) 

and similarly 

(5.9) 


- \\r K+1 \\ 2 

II a.mIIh 


K 2 

x + s 2 a(t)( n^fciiHi), 


k =0 


K 


r4SiiH 2 > KailH 1 -^(r)(EiKii Hl ) ■ 


k =0 


Therefore combining (5.8) and (5.9), and then applying (1.17) and (5.4) when 
taking limit as K —> oo, we obtain 

-WT)(£lKI| Hl ) 2 < IITxii^ - ||*||^ < 6 2A (T)(J2 IKI| Hi ) 2 , 

fc >0 fc >0 

which implies that 

(5.10) \\Tx\\n 2 -VS^Jf)J2 II^Hhi - II x IIhi < \\Tx\\u 2 + V^2a{T) ^ IWIh^ 


k>0 


k> 0 


By ( |1.19D , 
(5.11) 


I'WfcII hi < '\fuA.Wuk-l ||Mi k > 1. 


This together with (1.14) and (1.15) implies that 

(5T2) __ 

iwini - ii^oIIhi + ii^iIIhi+ ihfe-iiiM < V2\\x\\ Hi + VOA^A, 


m(tJ 


fc >0 


fc> 2 


Combining (5.6), (5.10) and (5.12) gives 

| H-F'(ic)|| h 2 — ||^||hi | < (v^^ + 7E,T(2A))(^||x|| H i+VaA-A,M(x)). 

Reformulating the above estimates completes the proof of the estimate 
the sparse Riesz property of F. 


for 

□ 


Theorem 5.3. Let Hi, H 2 , M, A, T, F be as in Theorem |5.1| with additional as¬ 
sumption that 7_f,t(4A) < 00 . Then F has the almost linear property on A, 

||F(x) - F(y) - F(x - y) ||h 2 < 27 F , T ( 4 A)||x - y|| H i 

+2(7f,t(2A) + 7f,t(4A)) v /oX(cja : m(-t) + 0A,M(y))- 


(5.13) 
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Proof. Take x, y £ M, and let x k A M and y AAil k>0, be as in the greedy algorithm 
(5.3) to approximate x and y £ M respectively. Write 

\\F(x)~ F{y)-T(x-y)\\u 2 < \\F(x) - F(x 2 AM ) - T(x - ®a,m)IIh 2 

+ \\ F (y) ~ Hvam) ~ T(y — 2 /a,m)IIh 2 

+ II^( x a.m) _ F(Ua,m) _ t ( x a,m ~ 2 /a,m)IIh 2 
(5.14) =: / 1 +/ 2 +/ 3 . 

By (1.19), (1.26), ( |1.27[ ), ( |5.4| ) and the continuity of F and T on Hi, we get 

h < Ell F (4tM)-^A,M)-n4tM-<M)|| H2 

k> 2 


< 7f,t(A) ^2 \\ xl AM ~ ^A.mIIh, 


k> 2 


(5.15) 

and similarly 

(5.16) 


< 7f,t(A) v / «A cr A,M(^) < 7 T’,t(2A) v / OaO'A,m(^) 
h < 7f,t(2A)^X^a,m(2/). 


For the term 13 , we obtain from (1.19) and (1.27) that 
h < 7f,t(4A)||x^ iM — j/a. M IIhi 

< 7f,t(4A)(||x - 2 /|| Hi + \\x - xi iM || Hl + II V - 2 /a,mIIhi) 
(5.17) < 7f,t(4A)(||x — 2 /11 h 1 + v / oa(o'a,mOe) + o"a.m(2/)))- 

Combining estimates in ( |5.14[ )-( 5.17| ) gives 
\\F(x) - F(y)-T(x - y)\\ < 7f,t(4A)||x - y|| Hl 


(5.18) 

Write 


+ (7f,t(2A) + 7f,t(4A)) i/oa^a.mW + ca.m^))- 


\\f{x - y) - T(x - 

< \\F(x — y) — F(x a m ~ V) ~ ~ "T A,Mi 

+ II^ 1 ( X A,M ~ y) - F ( x A,M - Z/a.m) _ T(yA,M - 


~ y) — t{x — x\- 


+ II^’( x a,m — Vam) ~ F(0) - T(x am — 7a,m)II- 

Following the arguments used to establish ( |5. 18 ), we have 
\\ F ( X ~ y)-T(x - y)\\ < 7f,t(4A)||x - y\\ Hl 

(5.19) +(7f,t(2A) + 7f,t(4A)) v / oa(o'a,m(2:) + <Ta,m(2/))- 

Combining (5.18) and (5.19) proves the estimate ( |5.13[ ) for the almost linear prop¬ 
erty of the map F. □ 


Combining Theorems 4.1, 5.1| and 5.3 leads to the following result on the stable 
reconstruction of sparse signals x from their nonlinear measurements F(x) when F 
is not far away from a measurement matrix T with the restricted isometry property 

<p6). 
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Theorem 5.4. Let Hi,M, A, T, F be as in Theorem \5.1\ with 
\/2(\/ 5 2A (T) + 7f,t(2A)) + 47^t(4A) 

$2A(T) + 37f,t(2A) + 4 v ^4a(70)v /ci asa < L 

Then for any given x° £ M and £ > 0, the solution of the minimization problem 
{ 1.21\ has the following error estimates: 

(5.20) ||xm - ®°||hi < C ly /aX(j AM (x°) + C 2 e 
and 

(5.21) II^M — ®°||m < C , 1 °'A,m( 3 ;0 ) + C 2 yfS A e 

where C\ and C 2 are absolute constants independent on x° £ M and e > 0. 


Applying Theorem 5.4 to linear maps, we have the following corollary. 


Corollary 5.5. Let Hi,M, A,H 2 and T be as in Theorem \5.4\ If 

d 2 A{T) < (V2 + yfa A s A ) 2 , 

then for any given x° E M and e > 0, the solution of the minimization problem 
(1.21) with F = T has the error estimates (5.20) and (5.21[). 


For classical sparse recovery problems, the conclusions in Corollary |5.5| have been 
established under weaker assumptions on the restricted isometry constant 5 2A (T), 
see Em and references therein. 


Acknowledgement The first author thanks Professor Yuesheng Xu for his 
invitation to visit Guangdong Province Key Laboratory of Computational Science 
at Sun Yat-sen University, China, where part of this work was done. 


Appendix A. Bi-Lipschitz map and uniform stability 


In this appendix, we provide some sufficient conditions, mostly optimal, for a 
differentiable map to have the bi-Lipschitz property (1.1), see Theorems A.3 and 


A.5 in Banach space setting, and Theorems | A.7 and A.9 in Hilbert space setting. 

For a differentiable map F from one Banach space Bi to another Banach space 
B 2 that has the bi-Lipschitz property (1.1), we have 

|| F(x + ty) - F(x 


A\\y\\ < 


t 


< B\\y\\ for all x,y £ Bi and t > 0, 


where A, B are the constants in the bi-Lipschitz property (1.1). Then taking limit 
as t —> 0 leads to a necessary condition for a differentiable bi-Lipschitz map. 


Theorem A.l. Let Bi and B 2 be Banach spaces. If F : Bi — > B 2 is a differen¬ 
tiable map that has the bi-Lipschitz property (1.1), then its derivative F'(x),x £ Bi, 
has the uniform stability property (1.2). 


For Bi = B 2 = M, a differentiable map F with the uniform stability property 


general Banach space setting. Maps E p e , 1 < p < 00 , e £ [0,7r/4), from M to in 
the example below are such examples. 


(1.2) for its derivative has the bi-Lipschitz property (jTTTj) , but it is not true in 
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Example A.2. For 1 < p < oo and e E [0, vr/4), define E p ^ e : M i—» M 2 by 


(A.l) 


Ep„ e (t ) 


(— cos e, sine) — (sine, cos e)(t + 7r/2 + e) 
if t E (—oo, —7r/2 — e), 
< (sint, — cost) if f E [—7r/2 — e,7r/2 + e], 
(cos e, sine) + ( — sine, cos e)(f — 7r/2 — e) 
if t E (n/2 + e, oo), 


see Figure [l] The maps E Jpt just defined do not have the bi-Lipschitz property 




FIGURE 1. Maps E p from R to M 2 with e = 0 (left) and e = 7 t/ 6 (right). 


^1*1 < iK.w'iip= 


if t < — 7t/ 2 — e 
if \t\ < 7t/ 2 + e 
if t > 7t/ 2 + e 


(1.1), but their derivatives E' pe have the uniform stability property (1.2), 

|| (t sin e,t cose) ||p 
|| (t cos t,t sint) ||p 
|| (—tsine, t cose) | 

< 2|t| for all t, t E M, 

where ||-||p,l<p<oo, is the p-norm on the Euclidean space M 2 . 

Given a differentiable bi-Lipschitz map F from one Banach space Bi to another 
Banach space Bo such that its derivative F' (x) is uniformly stable, define 

[x^y 

(A.2) ap := sup inf sup ,, — 7 , —77 — 2 


sup 

lwll=l 


inf sup 
l 2 ll = l xgBi 


\\F'(x)y\\ 


The quantity oif is the minimal radius such that for any 0 /i/EBi, the set 
of unit vectors F'(x)y/\\F'(x)y\\,x E Bi, is contained in a ball of radius ap < 1 
centered at a unit vector. Our next theorem shows that a differentiable bi-Lipschitz 
map F with its derivative F'{x) being uniformly stable and continuous and with 


ap in (A.2) satisfying of < 1 has the bi-Lipschitz property (1.1). 


Theorem A.3. Let Bi and B 2 be Banach spaces, and F be a continuously differ¬ 
entiable map from Bi to B 2 with the property that its derivative has the uniform 


stability property (1.2). If cxf in (A.2) satisfies 
(A.3) 


CtF < 1 , 
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then F is a bi-Lipschitz map. 
Proof. Given x,y £ Bi with y / 0, 


F{x + y) — F{x) = J F'(x + ty)ydt = \\F'{x + ; 


+ 


where 2 £ B 2 with II 2 II = 1. Thus 


\\F(x + y) - F(x)\\ > 




x (1 — inf sup 

11*11=1 0<t<l 


F'{x + ty)y 


\\F'(x + ty)y\\ 


— z 


> (1 — OCf) 


(jf l IF ' (I+ 


> (1 - ot F )A\\y\\, 


and 


\\F{x + y) — F(x)\\ < / \\F'(x + ty)y\\dt < B\\y\\, 


where A, B are lower and upper stability bounds in the uniform stability property 
(1.2). Combining the above two estimates completes the proof. □ 


Remark A.4. The U-shaped map E p f . in Example A.2 with p = 00 and e = 0 is 
not a bi-Lipschitz map and 




00 ,0 


= inf sup 
IMI°o = l \t\<n/2 


(cos t, sin#) 


= sup 

|t|<7r/2 


max(| cos#|, | sin#|) 
(cos t, sin#) 


— z 


max(| cos#|, | sin#|) 


-( 1 , 0 ) 


= 1 . 


This indicates that the geometric condition (A.3) about a F is optimal. 

For a differentiable map F not far away from a bounded below linear operator 
T, we suggest using Ty/\\Ty\\ as the center of the ball containing the set of unit 
vectors F'(x)y/\\F'(x)y\\, x £ Bi, and define the minimal radius of that ball by 
/3 f ,t in (1.5). Then obviously 

(A.4) ^ Pf,t- 

This together with Theorem A.3| implies that a differentiable map F satisfying 
Pf,t < 1 is a bi-Lipschitz map. 

Theorem A.5. Let Bi and B 2 be Banach spaces, and F be a continuously differ¬ 
entiable map from Bi to B 2 with its derivative having the uniform stability property 
(1.2). 7/T£B(B 1 ,B 2 ) is bounded below and satisfies (|1.4|), then F is a bi-Lipschitz 


map. 
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We may use the following quantity to measure the distance between differentiable 
map F and bounded below linear operator T, 

(A.5) 

\\F(x + y) — F(x) — Ty\\ \\F'(z)y-Ty\\ 

0 F ,T ■= sup sup -——- = sup sup -——-. 

O^eBizeB! W-Ly || O^yeBizeBi \\J-y\\ 

By direct computation, 


a T K x )l 

Pf,t < sup sup ——— 

||vll=i xeB 1 v w* { 


F'{x)y - Ty\\ 


\\\ F '(x)y\\ ~ W|K 2 5 F)T 
" ~ 1 — 5 f,t 


\\F'(x)y\\ 

Thus the geometric condition (1.4) in Theorem A.5| can be replaced by the condition 
5f,t < 1/3. 


Corollary A.6. Let Bi,B 2 ,T and T be as in Theorem A.5. If 5f,t < 1/3, then 
F is a bi-Lipschitz map. 


The geometric condition (1.4) to guarantee the bi-Lipschitz property for the map 
F is optimal in general Banach space setting, as /3 e OOi0 ,t i = 1 for the U-shaped map 
Eoofl in Example A.2 and the linear operator T±t := (t, 0),f 6 M. But in Hilbert 
space setting, as shown in the next theorem, the geometric condition (1.4) could 
be relaxed to /3f,t < \/2. 

Theorem A.7. Let Hi and H 2 be Hilbert spaces, and let F : Hi — > H 2 be a con¬ 
tinuously differentiable map with its derivative having the uniform stability property 
(1.2). If there exists a linear operator T e H(Hi,H 2 ) satisfying ( |1.3[ ) and ( |1.6[ ) ; 
then F is a bi-Lipschitz map. 

Proof. Take «,»£ Hi with v 0. Then 


(A.6) 


\\F(u + v) -F(u)|| < 


f iin 

Jo 


u + tv)v\\dt. < B||u||, 


where B is the upper stability bound in (|1.2|). Observe that 


(F'(u)v,Tv) = \\F'(u)v\\\\Tv\\(l-- 


1 


F'{u)v 


Tv 


Then 


\\F'(u)v\ | |Tu|| 


(F>(u)v,Tv) > 2 il3 n F ' T)2 \\F\u)v\\\\Tv\\, 


which implies that 

(F(u + v) — F(u),Tv) = 


{F’(u + tv)v, Tv)dt 


(A.7) 


> 


> 


2 — (/3 f,tY 
2 

2 - (Pf.tY 


(/ \\F'(u + tv)v\\dtJ\\Tv\\ 


A||Tu|| ||u||, 


where A is the lower stability bound in (1.2). Hence 
(A.8) ||F(u + u)-F(u)|| > 


{F{u + v) -F(u),Tv) > 2-{P F} t) 


ll^ll 


-A||u||. 


2 
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Combining (A.6) and (A.8) proves the bi-Lipschitz property for F. 


□ 


E Pt6 in Example |A.2| with p = 2 and e = 0, 


Remark A.8. The geometric condition (|1.6|) is optimal as for the U-shaped map 

M) 


(A.9) Pe 2 ,o,Ti = sup sup 

t^O \t\<ir/2 

where Tit = (t, 0), t 6 M. 

Define 


(tcost, tsinf) 


(cos 2 1 + sin 2 ty/ 2 \t\ |t| 


= V 2 


f (F (u)v, Tv) \ 
e F)T = sup arccos , 

«G Hlv^o MI-FfaMIM'' 

the maximal angle between vectors F'{u)v and Tv in the Hilbert space H 2 . Then 

P f,t = 2 sin —^. 


So the geometric condition (1.6) can be interpreted as that the angles between 
F'(u)v and Tv are less than or equal to Of.t £ [0, vr/2) for all u, v E Hi. The 


above equivalence between the geometric condition (1.6) and the angle condition 


9 f ,t < 7t/ 2, together with (1.2) and (1.3), implies the existence of positive constants 
Ai, B\ such that 

(A.10) Ai||Tu|| 2 < {F'{u)v,Tv) < Hi||ru|| 2 , u,v E Hi. 

The converse can be proved to be true too. Thus (3 f ,t < if an d only if S := T*F 
is strictly monotonic. Here a bounded map S on a Hilbert space H is said to be 
strictly monotonic Ea if there exist positive constants m and M such that 

rail'll — w|| 2 < (u — v, S(u ) — S(v)} < M\\u — u|| 2 for all u, v G H. 


As an application of the above equivalence, Theorem A.7 can be reformulated as 
follows. 

Theorem A.9. Let Hi and H 2 be Hilbert spaces, and let F : Hi — > Ho be a con¬ 
tinuously differentiable map with its derivative having the uniform stability property 


(1.2). If there exists a linear operator TeB( Hi,H 2 ) satisfying (|1.3[) and (|A.10[) ; 


then F is a bi-Lipschitz map. 


From Theorem A.9 we obtain the following result similar to the one in Corollary 

El 


Corollary A.10. Let Hi,H 2 and F be as in Theorem A. 
bounded below linear operator T E £>(Hi,H 2 ) with 5 Fj t < 
bi-Lipschitz map. 


Xg[ 

cV2 


If there exists a 
— 1, then F is a 


Given a differentiable map F, it is quite technical in general to construct linear 


operator T satisfying (1.3) and (1.4) in Banach space setting (respectively (1.3) 
and (A.10) in Hilbert space setting). A conventional selection is that T = F'(xq) 
for some .To G Bi, but such a selection is not always favorable. Let T = (0 a)asA be 
impulse response vector with its entry (j>\ being the impulse response of the signal 
generating device at the innovation position A E A, and T = (^ 7 ) 7g p be sampling 
functional vector with entry ^ 7 reflecting the characteristics of the acquisition 
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device at the sampling position 7 G T. In order to consider bi-Lipschitz property 
of the nonlinear sampling map 

: f(A) 3X^X T <Z> f{x T^ sampling * } € f {T) 

related to instantaneous companding h[t) 1 —> f(h(t)), a linear operator 
T := (A<i>^) A^^) A<i> (Ai^xj/) 


satisfying (1.3) and (A. 10) is implicitly introduced in [28], where 


Vb^AeA^er 

is the inter-correction matrix between <b and T. 


Appendix B. Sparse approximation triple 


In this appendix, we prove Propositions 5.2 and 4.2 and conclude it with a 


remark on the greedy algorithm (5.3). 


Proof of Proposition\5.S\ The convergence of x \^ M , k > 0, follows from 


K 

£ 

k =0 


l x AJVI — x a,mIIm = IMIm — II® — x A,mIIm < ||x|| M , K > 0, 


by the norm splitting property (1.15). Denote by G M the limit of x\ M , k > 

.OO " " ' ‘ 

A,M 


0. Then the limit satisfies the following consistency condition: 


(B.l) 


(®A,M> y) = (x,y) 


for all y G A. The above consistency condition holds as 0 = argmin £gA ||x —x^ M — 
s|| M , which together with the norm-splitting property (1.14) in Hi implies that 


the projection of x — x AM onto A,; are zero for all i G /. From the consistency 


condition (B.l), we conclude that (B.l) hold for all y G kA.k > 0, and hence for 


all y in the closure of Ufc>oA;A. This together with the sparse density property of 
the sparse approximation triple (A, M, Hi) proves the convergence of x A M , k > 0, 
to x G M. □ 


Proof of Proposition ^. 4 Take O/ 16 M and let x A M , k > 0, be as in the greedy 
algorithm ( |5.3[ ). Write Uk = — x A M , k > 0. Thus 


u k = argmin £gA ||x - x AM - x|| M = argmin £gA ||(x - x A - u k -1 - x||m £ A, 


and x - ® a,m = J2k> 1 
implies that 


Uk by Proposition 


5.2 


This together with (1.15) and (1.19) 


|x — X A ,m||h 1 < ^ 


H, < 


-y/flA F. Il^fc—1 IIm = Voa\\x\ 


M- 


k> 1 


k> 1 


This completes the proof. 


□ 
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Given a sparse approximation triple (A, M, Hi), we say that x E M is com¬ 
pressible ([HISS, 2£S 32]) if {cTfcA.M(^)}/iLi having rapid decay, such as 

07 , : a : m(t) < Ck~ a for some C, a > 0, 
where (T k A,M.{x) is the best approximation error of x from kA, 

(B.2) a k a.m(x) := inf \\x - x||m, k > 1. 

x£kA 


For the sequence x A M , k > 0, in the greedy algorithm (5.3), we have 


(B.3) 


! -'A,M 


— x||m > °A,-A,m(‘T), 


as x k A M E kA,k > 1. The above inequality becomes an equality in the classical 
sparse recovery setting. We do not know whether and when the greedy algorithm 
(5.3) is suboptimal, i.e., there exists a positive constant C such that 


(B.4) 


l a,m - x \\m < Ca k am(x), x e M, 


even for compressible signals. The reader may refer to 
greedy algorithms. 


for the study of various 


References 

[1] A. Aldroubi, Q. Sun and W.-S. Tang, p-frames and shift invariant subspaces of L p , J. Fourier 
Anal. Appl, 7(2001), 1 -21. 

[2] R. Balan, B. Bodmann, P. Casazza and D. Edidin, Painless reconstruction from magnitudes 
of frame coefficients, J. Fourier Anal. Appl, 15(2009), 488-501. 

[3] R. Balan, P. G. Casazza and D. Edidin, On signal reconstruction without noisy phase, Appl. 
Comput. Harmon. Anal, 20(2006), 345-356. 

[4] A. S. Bandeira, J. Cahill, D. G. Mixon and A. A. Nelson, Saving phase: injectivity and 
stability for phase retrieval, Appl. Comput. Harmon. Anal, 37(2014), 106-125. 

[5] A. Beck and Y. C. Eldar, Sparsity constrained nonlinear optimization: optimality conditions 
and algorithms, SIAM J. Optim., 23(2013), 1480-1509. 

[6] B. Blackadar and J. Cuntz, Differential Banach algebra norms and smooth subalgebras of 
(T-algebras, J. Operator Theory, 26(1991), 255-282. 

[7] T. Blumensath and M. E. Davies, Sampling theorems for signals from the union of finite¬ 
dimensional linear subspaces, IEEE Trans. Inform. Theory, 55(2009), 1872-1882. 

[8] T. Blumensath and M. E. Davies, Sampling and reconstructing signals from a union of linear 
subspaces, IEEE Trans. Inform. Theory, 57(2011), 4660-4671. 

[9] T. Blumensath, Compressed sensing with nonlinear observations and related nonlinear op¬ 
timization problems, IEEE Trans. Inform. Theory, 59(2013), 3466-3474. 

[10] J. M. Borwein and S. Fitzpatrick, Existence of nearest points in Banach spaces, Can. J. 
Math., 61(1989), 702-720. 

[11] T. Cai and A. Zhang, Sharp RIP bound for sparse signal and low-rank matrix recovery, 
Appl. Comput. Harmon. Anal, 35(2013), 74-93. 

[12] E. J. Candes, J. K. Romberg and T. Tao, Stable signal recovery from incomplete and inac¬ 
curate measurements, Comm. Pure Appl. Math., 59(2006), 1207-1223. 

[13] E. J. Candes, T. Strohmer and V. Voroninski, PhaseLift: Exact and stable signal recov¬ 
ery from magnitude measurements via convex programming, Comm. Pure Appl. Math., 
66(2013), 1241 1274. 

[14] E. J. Candes and T. Tao, Decoding by linear programming, IEEE Trans. Inform. Theory, 
51(2005), 4203-4215. 

[15] P. G. Casazza, D. Han and D. R. Larson, Frames for Banach spaces, In The Functional and 
Harmonic Analysis of Wavelets and Frames (San Antonio, TX, 1999), Contemp. Math., 
247, Amer. Math. Soc., Providence, RI, 1999. pp. 149-182. 




28 


QIYU SUN AND WAI-SHING TANG 


[16] P. G. Casazza and G. Kutyniok and S. Li, Fusion frames and distributed processing, Appl. 
Comput. Harmon. Anal, 25(2008), 114-132. 

[17] O. Christensen An Introduction to Frames and Riesz Bases, Birkhauser Boston Inc., 2003. 

[18] O. Christensen and D. T. Stoeva, p-frames in separable Banach spaces, Adv. Comput. Math., 
18(2003), 117-126. 

[19] O. Christensen and T. Strohmer, The finite section method and problems in frame theory, 
J. Approx. Theory, 133(2005), 221-237. 

[20] J. Dieudonne Foundations of Modem Analysis, Academic Press, 1969. 

[21] T. G. Dvorkind, Y. C. Eldar and E. Matusiak, Nonlinear and nonideal sampling: theory 
and methods, IEEE Trans. Signal Process., 56(2008), 5874-5890. 

[22] M. Elder, M. Fornasier and J. Sigl, Quasi-linear compressed sensing, Multiscale Model. 
Simul., 12(2014), 725-754. 

[23] Y. C. Eldar and G. Kutyniok, Compressed Sensing: Theory and Applications, Cambridge 
University Press, 2012. 

[24] Y. C. Eldar and S. Mendelson, Phase retrieval: stability and recovery guarantees, Appl. 
Comput. Harmon. Anal, 36(2014), 473-494. 

[25] Y. C. Eldar and M. Mishali, Robust recovery of signals from a structured union of subspaces, 
IEEE Trans. Inform. Theory, 55(2009), 5302-5316. 

[26] S. Foucart and H. Rauhut, A Mathematical Introduction to Compressive Sensing, Springer, 
2013. 

[27] S. Foucart and M.-J. Lai, Sparsest solutions of underdetermined linear systems via I q - 
minimization for 0 < q < 1, Appl. Comput. Harmon. Anal., 26(2009), 395-407. 

[28] J. Gevirtz, Injectivity in Banach spaces and the Mazur-Ulam theorem on isometries, Trans. 
Amer. Math. Soc., 274(1982), 307-318. 

[29] K. Grochenig, Wiener’s lemma: theme and variations, an introduction to spectral invariance 
and its applications, In Four Short Courses on Harmonic Analysis: Wavelets, Frames, Time- 
Frequency Methods, and Applications to Signal and Image Analysis edited by P. Massopust 
and B. Forster, Birkhauser, 2010. pp. 175-234. 

[30] K. Grochenig and A. Klotz, Noncommutative approximation: inverse-closed subalgebras and 
off-diagonal decay of matrices, Constr. Approx., 32(2010), 429-466. 

[31] K. Grochenig and M. Leinert, Symmetry of matrix algebras and symbolic calculus for infinite 
matrices, Trans. Amer. Math. Soc., 358(2006), 2695-2711. 

[32] P. Hall and J. Jin, Innovated higher criticism for detecting sparse signals in correlated noise, 
Ann. Statist., 38(2010), 1686-1732. 

[33] S. Jaffard, Properietes des matrices bien localisees pres de leur diagonale et quelques appli¬ 
cations, Ann. Inst. Henri Poincare, 7(1990), 461-476. 

[34] K.-L. Jun and D.-W. Park, Almost Linearity of e-Bi-Lipschitz Maps between Real Banach 
Spaces, Proc. Amer. Math. Soc., 124(1996), 217-225. 

[35] E. Kissin and V. S. Shulman, Differential properties of some dense subalgebras of C*- 
algebras, Proc. Edinburgh Math. Soc., 37(1994), 399-422. 

[36] I. Krishtal, Wiener’s lemma: pictures at exhibition, Revista Union Matematica Argentina, 
52(2011), 61-79. 

[37] M. V. Klibanov, P.E. Sacks and A.V. Tikhonravov, The phase retrieval problem, Inverse 
problems, 11(1995), 1-28. 

[38] K.-S. Lau, On a sufficient condition for proximity, Trans. Amer. Math. Soc., 251(1979), 
343-356. 

[39] Y. M. Lu and M. N. Do, A theory for sampling signals from a union of subspaces, IEEE 
Trans. Signal Processing, 56(2008), 2334-2345. 

[40] N. Motee and A. Jadbabaie, Optimal control of spatially distributed systems, IEEE Trans. 
Automatic Control, 53(2008), 1616-1629. 

[41] M. Z. Nashed and Q. Sun, Sampling and reconstruction of signals in a reproducing kernel 
subspace of L p (R d ), J. Fund. Anal, 258(2010), 2422-2452. 

[42] H. Rauhut and R. Ward, Interpolation via weighted 11 minimization, Appl. Comp. Harmon. 
Anal, accepted. 



NONLINEAR FRAMES AND SPARSE RECONSTRUCTIONS IN BANACH SPACES 


29 


[43] M. A. Rieffel, Leibniz seminorms for “matrix algebras converge to the sphere”, In Quanta 
of Maths, Volume 11 of Clay Math. Proc., Amer. Math. Soc., 2010, pp. 543-578. 

[44] C. E. Shin and Q. Sun, Stability of localized operators, J. Fund. Anal., 256(2009), 2417- 
2439. 

[45] Q. Sun, Wiener’s lemma for infinite matrices with polynomial off-diagonal decay, C. Acad. 
Sci. Paris Ser I, 340(2005), 567-570. 

[46] Q. Sun, Non-uniform sampling and reconstruction for signals with finite rate of innovations, 
SIAM J. Math. Anal., 38(2006/07), 1389-1422. 

[47] Q. Sun, Wiener’s lemma for infinite matrices, Trans. Amer. Math. Soc., 359(2007), 3099- 
3123. 

[48] Q. Sun, Localized nonlinear functional equations and two sampling problems in signal pro¬ 
cessing, Adv. Comput. Math., 40(2014), 415-458. 

[49] Q. Sun, Wiener’s lemma for localized integral operators, Appl. Comput. Harmon. Anal., 
25(2008), 148-167. 

[50] Q. Sun, Wiener’s lemma for infinite matrices II, Constr. Approx., 34(2011), 209-235. 

[51] Q. Sun, Sparse approximation property and stable recovery of sparse signals from noisy 
measurements, IEEE Trans. Signal Processing, 10(2011), 5086-5090. 

[52] Q. Sun, Recovery of sparsest signals via ^-minimization, Appl. Comput. Harmon. Anal., 
32(2012), 329-341. 

[53] W. Sun, G-frames and g-Riesz bases, J. Math. Anal. Appl., 322(2006), 437-452. 

[54] V. Temlyakov, Greedy Approximation, Cambridge University Press, 2011. 

[55] E. Zeidler and P. R. Wadsack, Nonlinear Functional Analysis and its Applications, Vol. 1, 
Springer-Verlag, 1998. 

Qiyu Sun: Department of Mathematics, University of Central Florida, Orlando, 
FL 32816, USA. Email: qiyu.sun@ucf.edu 

Wai-shing Tang, Department of Mathematics, National University of Singapore, 
Singapore 119076 Republic of Singapore. Email: mattws@nus.edu.sg 



